Hardware Accelerator Design of DCT Algorithm With Unique-Group Cosine Coefficients for Mel-Scale Frequency Cepstral Coefficients

نویسندگان

چکیده

This study presents a compact L-points discrete cosine transform (DCT) hardware accelerator for M-points Mel-scale Frequency Cepstral Coefficients (MFCC). The main contributions of this work can be summarized as 1) proposing an algorithm with lower complexity; 2) achieving higher accuracy performance; 3) implementing low-cost unique group coefficients. For derivation, the proposed method converts original formula into type IV (DCT-IV) preprocessing procedure. kernel computation DCT-IV further derived same multiplication preprocessing. Therefore, total (M-1) (L-1) additions, L multiplications, and coefficients are required computation. Compared Jo et al.’s algorithm, respectively reduces number additions multiplications by 42.32 % 41.67 %. Instead, is increased 33.33 Moreover, exhibits peak signal-to-noise ratio (PSNR) value which achieved at 90.1dB 16-bit coefficient word length. realization, FPGA implementation results show that it operate clock rate 135.85 MHz requires only 113 combinational elements, 87 registers, 3 DSP multipliers, 64×16 bits RAM 32×16 ROM. Overall, would good choice integrating MFCC applications in future.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mel Frequency Cepstral Coefficients for Music Modeling

We examine in some detail Mel Frequency Cepstral Coefficients (MFCCs) the dominant features used for speech recognition and investigate their applicability to modeling music. In particular, we examine two of the main assumptions of the process of forming MFCCs: the use of the Mel frequency scale to model the spectra; and the use of the Discrete Cosine Transform (DCT) to decorrelate the Mel-spec...

متن کامل

The Capacity of Mel Frequency Cepstral Coefficients for Speech Recognition

Speech recognition is of an important contribution in promoting new technologies in human computer interaction. Today, there is a growing need to employ speech technology in daily life and business activities. However, speech recognition is a challenging task that requires different stages before obtaining the desired output. Among automatic speech recognition (ASR) components is the feature ex...

متن کامل

Multiple Time Resolutions for Derivatives of Mel-frequency Cepstral Coefficients

Most speech recognition systems are based on melfrequency cepstral coefficients and their firstand secondorder derivatives. The derivatives are normally approximated by fitting a linear regression line to a fixed-length segment of consecutive frames. The time resolution and smoothness of the estimated derivative depends on the length of the segment. We present an approach to improve the represe...

متن کامل

Speech reconstruction from mel frequency cepstral coefficients and pitch frequency

This paper presents a novel low complexity, frequency domain algorithm for reconstruction of speech from the melfrequency cepstral coe cients (MFCC), commonly used by speech recognition systems, and the pitch frequency values. The reconstruction technique is based on the sinusoidal speech representation. A set of sine-wave frequencies is derived using the pitch frequency and voicing decisions, ...

متن کامل

Using Mel-Frequency Cepstral Coefficients in Missing Data Technique

Filter bank is the most common feature being employed in the research of the marginalisation approaches for robust speech recognition due to its simplicity in detecting the unreliable data in the frequency domain. In this paper, we propose a hybrid approach based on the marginalisation and the soft decision techniques that make use of the Mel-frequency cepstral coefficients (MFCCs) instead of f...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2022

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2022.3194261